Influence of Projection in Cluster Cosmology Studies 
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Projection tends to skew the mass-observable relation of galaxy clusters by creating a small 
fraction of severely blended systems, those for which the measured observable property of a cluster 
is strongly boosted relative to the value of its primary host halo. We examine the bias in cosmological 
parameter estimates caused by incorrectly assuming a Gaussian (projection-free) mass-observable 
relation when the true relation is non-Gaussian due to projection. We introduce a mixture model 
for projection and explore Fisher forecasts for a survey of 5000 deg^ to z = 1.1 and an equivalent 
mass threshold of 10^^'^ /i~^M0. Using a blended fraction motivated by optical cluster finding 
applied to the Millennium Simulation, and applying Planck and otherwise weak priors, we find that 
the biases in Que and w are significant, being factors of 2.8 and 2.4, respectively, times previous 
forecast uncertainties. Incorporating eight new degrees of freedom to describe cluster selection 
with projection increases the forecast uncertainty in S^de and w by similar factors. Knowledge of 
these additional parameters at the 5% level limits degradation in dark energy constraints to < 10% 
relative to projection-free forecasts. We discuss strategies for using simulations and complementary 
observations to characterize the fraction of blended clusters and their mass selection properties. 



I. INTRODUCTION 

Galaxy clusters offer tests of large-scale gravity and cos- 
mology, as their space density is exponentially sensitive to 
the time-dependent amplitude of the matter power spec- 
trum and the cosmic expansion history (see Allen et al. 
[ij and Voit for recent reviews). Because their counts 
and clustering probe the gravitational growth of struc- 
ture, clusters provide information beyond that provided 
by CMB and cosmic distance measurements such as su- 
pernovae and baryon acoustic oscillations. Joining cluster 
data with such measurements significantly improves cos- 
mological parameter constraints |3|. 

While the potential for clusters to constrain parame- 
ters such as the dark energy equation of state, the en- 
ergy densities of cosmic components, and the amplitude 
of matter density fluctuations has long been known 

early work also emphasized the importance of under- 
standing systematic errors associated with survey model- 
ing [e.g. , [3, fioj . The agreement in cosmological parame- 
ters derived recently from independent samples selected 
at optical [ll| and X-ray [T2| - f3 | wavelengths indicates 
progress in addressing systematic errors. However, early 
Planck analysis of the thermal Sunyaev-Zel'dovich (SZ) ef- 
fect in the optically-selected maxBCG sample [l5| suggests 
more work to be done; the mean gas thermal energies in- 
ferred from Planck measurements lie a factor of two below 
simple model expectations [l^ . While the origins of this 
disagreement are not yet fully known, the effective offset 
in total mass of ~ 40% is in ~ 2.5a conflict with the mass 
calibration errors quoted in the above cosmological stud- 
ies. The tension may be partly relieved by introducing a 
~ 20% offset in X-ray and weak lensing masses but 
other effects need to conspire and amplify this. 

The Planck optical results highlight the importance of a 
key functional element of cluster cosmology from counts. 



namely the likelihood, p{Mohs\M, z), that a halo of mass 
M at redshift z has an observed property Mobs- For SZ 
observations, A/obs is the total thermal energy of the hot 
gas inferred from the spectral distortion in the cosmic mi- 
crowave background. For the case of optical studies we 
consider Mohs to be the optical richness, defined by the 
number of red galaxies in the cluster above a given magni- 
tude limit. Sky counts expected within a particular model 
are calculated by a convolution of this mass-observable 
function with the space density of halos. The latter has 
been calibrated to high precision by N-body simulations 

MMM. 



Since the scaling of most observables with mass are 
power-laws, and therefore linear in the logarithm, the con- 
volution kernel is typically assumed to be log-normally 
distributed about a power-law mean. The log-normal 
assumption for deviations in hot gas properties about 
the mean is supported by X-ray observations of core- 
excised luminosity and temperature in clusters [22j and 
from a range of hot gas properties in simulated halo sam- 
ples [2^, [ijl- However, some degree of departure from 
log-normality should be expected intrinsically, potentially 
driven by different formation histories psj and by major 
merging events [26j . 

A given intrinsic likelihood for halo observables will 
be modified when Mobs is projected onto the sky (27l - 
[29j . Halos projected along the line-of-sight of a given 
target boost its signal in a stochastic manner, resulting 
in a P(Mobs|M, z) that grows a tail to high values. Opti- 
cal richness, an attractive property to measure because 
it only requires broad-band photometry, is sensitive to 
line of sigh t proje ctions. Richness scales roughly linearly 
with mass p30l - [33 w hile X-ray and SZ signals scale more 
steeply, ~ M^-^ (33l - l36i |. making these observables less sus- 
ceptible to contamination from (spatially more abundant) 
lower mass halos. Indeed, the Abell catalog [13, con- 
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tains cautionary notes about projected confusion. Spec- 
troscopic studies of optically selected clusters occasionally 
reveal multiple peaks and complex structures in velocity 
space_(A1689 [p, CL1604 0: A85 [HI and EIS clus- 
ters [42|), and simulation studies are beginning to explore 
these issues in detail [H, 131 ■ Joint X-ray and optical 
studies of three nearby clusters show thermal signatures 
anticipated by gas dynamic simulations for actively merg- 
ing systems [45|. 

The statistical ingredients (the space density, spatial 
clustering, and galactic content of halos) needed to cal- 
culate projected confusion are coming into focus, and a 
generic expectation is that most massive halos suffer lit- 
tle contamination while a modest percentage are strongly 
affected by projection [l^H^. These studies motivate a 
Gaussian mixture model for projection that we explore in 
this paper. The mixture represents a dominant component 
of clusters whose sightlines are largely clean along with a 
minority of clusters whose signal is strongly boosted. The 
latter category we refer to as blended systems, or blends, 
and in these objects the Mobs signal is not dominated by 
a single halo. Our treatment here is intended to be illus- 
trative, but model parameters could be tuned using sky 
simulations tailored to specific surveys [13] . 

An earlier study of projection used an Edgeworth expan- 
sion to model cluster counts including non-zero skewness 
and kurtosis in p(Mobsl-M, z). Shaw et al. [i^ find that 
the detailed shape becomes important when the product 
of the scatter in the mass-observable relation and the log- 
arithmic slope of the mass function at the limiting mass 
is greater than one. Our study differs from that work 
in two ways: our Gaussian mixture approach, which in- 
cludes eight new degrees of freedom, is more general than 
their expansion, and we use a Fisher matrix approach to 
explicitly calculate both the bias that projection induces 
in a projection-free (single component Gaussian) analysis 
and the additional variance that is incurred when the ex- 
tra degrees of freedom are included. We explore the latter 
under a variety of prior constraints. 

The paper is organized as follows. In ^11 Dl we briefly 
recount the procedure for how to extract dark energy 
constraints by computing cluster counts and variance in 
counts, and present our parameterization of the mass- 
observable relation including the line-of-sight projection. 
In mill we present our results and, in i jlVl a discussion of 
the results. 



II. CLUSTER SELECTION MODEL WITH 
PROJECTION 

When Abell published the first homogeneous cluster 
catalog from photographic plate imaging, he employed the 
count of galaxies within a fixed metric aperture and a 
scaled magnitude range as a measure of galactic richness, 
a proxy for cluster mass [37j . The development of multi- 
band imaging cameras in the late 1990s [491 [soj enabled 
cluster samples to be selected using color selection tech- 



niques, whereby counts within a joint magnitude and color 
(or photometric redshift) range are employed as a mass 
proxy [TBI- Isil - Issj . These samples contain up to 69000 clus- 
ters extending to z ^ 1 across nearly 8000 deg^ of sky. The 
next generation of optical and near-infrared surveys — the 
Dark Energy Survey ^ (DES), the VISTA surveys^, Pan- 
STARRS'\ with Large Synoptic Survey Telescope^ (LSST) 
and Euclid^ to follow — will identify hundreds of thousands 
of clusters. 

Modern, color-based cluster finders rely on the 4000 A 
break feature of old stellar populations [5a] . Observations 
show, and stellar population models expect, that the mean 
color in a fixed observed band straddling 4000 A will vary 
with redshift. A single color can therefore be used as a 
simple photometric redshift estimator [53|. The redshift 
accuracy is limited by the finite color width of the red 
galaxy population at a given epoch. The finite width of 
the color filter employed for cluster finding in turn cor- 
responds to a comoving length scale of order hundreds of 
megaparsecs [1^. Red galaxies in spatially distinct halos 
that fall within a cylinder of this length aligned toward 
an observer will be catalogued as a single cluster (58| . 
We generically refer to this process as blending. While 
all clusters suffer some degree of projected blending, we 
are particularly interested in extreme cases, and so adopt 
a specific definition for classifying clusters. A cluster of 
observed richness Afobs will be referred to as a blended 
cluster if no single halo contributes Mobs/ 2 or more to 
the richness. Conversely, a cluster for which a single halo 
does contribute > Mobs/2 of the richness is referred to as 
clean. (We assume here that the radial scale for observed 
and intrinsic measures are aligned.) 

The massive halos that host clusters tend to be embed- 
ded in filaments and/or supercluster regions. Viewpaths 
that traverse such structures will have a locally boosted 
background in the color-magnitude sub-space used for 
cluster detection. Empirical studies of a new red se- 
quence matched filter method applied to SDSS maxBCG 
data [H, [sl] indicate that such boosts generate a blended 
fraction of ~ 10% in the cluster population. 

Existing Fisher matrix forecasts for the cosmological pa- 
rameter yield from upcoming surveys [^-d, [60t have as- 
sumed a log-normal distribution for the observable likeli- 
hood, p(Afobs|M, z). While the log-normal form may re- 
flect the intrinsic (e.g., spherically averaged) dispersion in 
the halo ensemble, blended clusters will have p(Mobs|M, z) 
strongly boosted at high Mobs- We use a Gaussian mix- 
ture model in log-mass, described in Hll Bl to model a bi- 
modal cluster population consisting of clean and blended 
systems. 

While our model is general, we tunc default parameters 



^ Ihttp : //www . darkenergysurvey . org/ 1 

^ http : //www . eso . org/sci/observing/policies/PublicSurveys 
^ http: //pan-Starrs . if a. ha waii . edu/public/| 
* http://www.lsst.org/lsstT 
^ jhttp: //sci . esa. int/euclld] 
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using the results of Cohn et al. [2^. That study apphcs 
a red sequence-based algorithm to proj ected galaxy maps 
from the Millennium Simulation [6l|,l62l- They use a single 
R — z color applied in narrow redshifts sliced centered at 
z = 0.4, 0.7 and 1.0. Matching halos to clusters by galac- 
tic membership, they identify a blended subset of clusters 
whose mass-observable relation is shifted to higher Mobs 
values and whose variance is larger than that of clean clus- 
ters. At higher redshifts, the mean color in the old stellar 
population varies more weakly with z, and the color width 
of the red sequence traces out an increasingly longer co- 
moving cylinder, reaching ^ 500 Mpc at z = 1. The 
longer search cylinder drives an increase in the blended 
fraction of clusters, from 11% at z = 0.4 to 22% at z = 1. 
Note that the fraction of halos at fixed true mass that 
are blended will be lower than this, as convolution with 
a steeply falling mass function increases the fraction of 
blended clusters at fixed Mobs |46l . 



A. Reference Model Survey 

Our reference model survey, based on DES-|-VISTA^, 
is assumed to cover 5000 square degrees and extend to 
a limiting redshift of Zmax = 1-1. Our choice of maxi- 
mum redshift is somewhat conservative since with the ad- 
dition of the IR filters from VISTA, the combined sur- 
veys should have accurate redshifts for field galaxies up to 
z - 1.5. . We assume that DES-h VISTA- VHS wiU detect 
clusters above an observed threshold. Mobs > Afth, with 
Mth = lO^^'^/i~^M0, comparable to what is achieved by 
low redshift surveys [HI, [SO] ) . Based on the maxBCG Agai 
richness measure, the zero-redshift variance in the mass- 
observable relation is taken to be CTq = 0.25 [63j . 

We subdivide the sky into 500 bins of 10 square de- 
grees each, and calculate the counts and sample vari- 
ance using richness bins of width AinMoba = 0.2 with 
the exception of the highest mass bin, which we extend 
to infinity. We set the width of our redshift bins to 
Az = 0.1. These bin sizes imply 11 redshift bins and 
10 mass bins. Wc assume fiducial cosmological parame- 
ters based on the fifth year data release of the Wilkinson 
Microwave Anisotropy Probe (WMAP5, ^^). Thus, we 
set the baryon density, VLbh? ~ 0.0227, the dark matter 
density, ft„ih^ = 0.1326, the normalization of the power 
spectrum at fc = 0.05Mpc"\ 5^ 4.625 x 10"^, the tih, 
n = 0.963, the optical depth to rcionization, r = 0.087, 
the dark energy density, Hue = 0.742, and the dark energy 
equation of state, w ^ —I. In this cosmology, erg = 0.796. 
With the exception of w, the cosmological parameters we 
use have been determined to an accuracy of a few percent. 
We apply Planck priors^ to all cosmological parameters. 
We use CMBfast [651, version 4.5.1, to calculate the trans- 



fer functions. 



B. The mass— observable relation for clean clusters 

Wc assume that the majority of clusters are clean sys- 
tems whose selection properties are described by a single 
log-normal form. Following the notation of [66j . we write 
the probability of observing a cluster with observable mass 
proxy. Mobs given a true mass M, as 



p(Mobs|M,z) 



1 



V27r<,, (M,z) 



exp [-x^] (!) 



with 



InMobs - InM - InMbias (M, z) 

X = , . (2) 

V2<m(M,z) 

The model allows for systematic error in the observable 
by allowing rcdshift-dcpendent bias and variance 

lnMbias(M, z) = Bo + Biln(l + z), (3) 

3 

<MiM,z)=al+Y,s,z\ (4) 



We set the fiducial values of So, Bi and the Si to zero 
throughout this paper. The baseline mass scatter, ctq, 
is taken to be 0.5, a value consistent with maxBCG find- 
ings for that survey's original iVgai richness estimator 
63 1 . Recently, Rykofi^ et al. [5^ proposed an improved 
mass-estimator for MaxBCG, with scatter expected to be 
0.2 = 0.3, making our assumption about the scatter con- 
servative. 

Below, we apply this single Gaussian model to fit a set 
of data that are described by our extended, two Gaussian 
case. For that fit, ctq has a slightly different value, and 
-Bq, Bi and the Si elements will be non-zero, as described 

in fjnis 



^ |http : //www ■ vista ■ ac ■ uk/ 1 , |http : //www . darkenergysurvey ■ org/ 1 

Planck Fisher matrix courtesy of Wayne Hu. 



C. Selection with projection: blended clusters 

To model selection with projection, we use a Gaussian 
mixture form for Mobs that combines clean and blended 
sub-populatons, 

p(Mobs|M, Z) = (1 - 7(z)) Gclcan + -f{z) Gblond (5) 

where Gcioan and Gbicnd are log-normal distributions of the 
form given by Eq. ([T|), and the blend factor, 7(z), controls 
the fraction of blended clusters. 

For the component representing blends, we introduce a 
set of parameters for the bias and scatter different than 
that of the clean component, 

lnMbias,b(z) = /io + aln(l + z) + /3 (ln7\/ - InAIth), (6) 
<^inM b(-^^ = <^oh + + SM (InM - InMth). (7) 
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We highlight below the role of the mass bias terms, espe- 
cially the constant offset, fXo, and its logarithmic redshift 
gradient, a. The parameter /3 allows for a mass-dependent 
bias. For the scatter of the blended component, we focus 
on a pessimistic scenario where ctq = 2(7g . This is con- 
sistent with results derived from Millennium Simulation 
analysis [l^ . The more optimistic case of ctq = CTq yields 
qualitatively similar results. The variance is allowed to 
evolve linearly with redshift and log-mass. 

Default parameter values for the blended component 
model are /3 = = = 0, and o-p 2cr§. We 
consider three specific combinations of fiQ and a that re- 
flect different scenarios of redshift evolution in the bias of 
the blended component: none {a = 0, no = 0.75); weak 
{a = 0.5, /xo = 0.5); and strong (a = 1, /io = 0.25). In 
all cases, the log-mean halo mass of blended clusters is 
biased low, by /io + Q;ln(l + z) relative that of the clean 
component. 

The blend factor controls the overall fraction of blended 
clusters, and we write its evolution as 



7(z) = 70 + 7iln(l + z)e~ 



(8) 



where the exponential damping is added only to regularize 
7 at high redshift. The blend factor grows with redshift to 
z = 0.77, then flattens and decreases weakly toward the 
z = 1.1 redshift limit. 

We choose this parameterization because it allows suf- 
ficient freedom to roughly match the blending fraction as 
a function of redshift found in Cohn et al. We cal- 

culate the blended fraction of clusters as a function of 
their observable mass proxy. Mobs, via convolution with 
the mass function, as described below. In Fig. [1] we show 
the resulting fraction of blended clusters above the survey 
threshold. 



N- 



blend 



blend 



N 



blend 



(9) 



lean 



as a function of redshift bin for 70 = and 71 ~ 0.05, 
0.15 and 0.25. The mean counts. A'', are given by Eq. pT|) 
below, where the clean and blended components are cal- 
culated using the associated components of Eq. ([5]). For 
each 71, the three lines show results for the three choices 
of {/i n, a } pairs discussed above. The results of Cohn 
et al. [2^, shown as the three black dots in the figure, are 
roughly matched by the choice of 71 = 0.15. 



D. Cluster counts and clustering 

The subject of deriving cosmological constraints from 
cluster number counts and clustering of clusters has been 
treated extensively in the literature (see, e.g., [sl. [60I IgtI. 
[68t). We give a brief summary in this section, following 
the approach described in [g^l, and leave other details to 
Appendix [A] 

The number density of clusters at a given redshift z with 
observable in the range Ma < Mobs < Ma+i is given by 




71 = 0.25 



71 = 0.15 



71 = 0.05 



FIG. 1: The fraction of blended clusters above the survey Mobs 
threshold is plotted for 70 = and 71 = 0.05, 0.15, 0.25 (bottom 
to top). The three black dots are the values found from the 
Millennium Simulation study of Cohn et al. [28l |. Color-styles 
correspond to three different redshift dependence forms (none, 
weak, strong) for the bias of the blending amplitude: /io ~ 
0.75, a = 0.0 (dotted, blue); /to ~ 0.5, a = 0.5 (dashed, green); 
/to = 0.25, a = 1 (solid, red). The models are tuned to coincide 
near the median cluster sample redshift of z = 0.65. 



riaiz) 



Ma 



+1 dM, 



obs 



Mobs 



dM dn 
M d\nM 



p(Mobs|M,z). (10) 



where -j^j is the mean halos space density, also called the 
mass function. We use the Tinker parameterization for the 
mass function, and ignore errors in redshift estimates. The 
mean cluster number counts, and sample covariance, in 
bins labeled by i = {a, b, c}, where a denotes mass proxy, 
b redshift, and c angular coordinate, are given by 



N,, 



dz^naW'.'^in) 
dz 



(11) 

(12) 



where Wc*^{n) is an angular top-hat window function. 
Define the covariance matrix of halo counts 



Si' 



N,S,j 



(13) 



where Ni is the vector of mean counts defined in Eq. ([TT|) 
and Sij is the sample covariance defined in Eq. (fT2|) . The 
indices i and j refer to observable, redshift and angular 
coordinate bins. Assuming Poisson noise and sample vari- 
ance are the only sources of noise, the Fisher matrix is. 



F^p = n'^C-'N,0 + ^Tr {C-iS,„C-iS.0} , (14) 

where commas denote derivatives with respect to the 
model parameters. The first term on the right-hand side 
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contains the information from the mean counts, TV. The 
Sij matrix can be thought of as contributing noise to this 
term, and hence only reduces the information content from 
counts alone. The second term contains the information 
from the sample covariance. 

The marginalized error in a parameter is given by 
^{Oa) = [{F^^)aa]^^^ ■ Priors are easily included in the 
Fisher matrix. If parameter 6i has a prior uncertainty of 
a{6i), we add a~'^{9i) to the Fa entry of the Fisher matrix 
before inverting. 



III. RESULTS 

Our model with projection differs from previous models 
that assume an entirely clean (single log-normal) popula- 
tion. Applying a clean-only model to a sky with projection 
will generally introduce a bias into derived cosmological 
parameters. We first address the magnitude of this bias, 
then turn to the impact that introducing extra degrees 
of freedom to represent blends has on marginalized con- 
straints of dark energy parameters. 



A. Parameter Bias 

To estimate the bias in cosmological constraints that 
would result if cluster samples with projection are an- 
alyzed using a model with no projection, we follow a 
linearized approach used in previous studies [tiI It^ . 
Our "true" sky counts are based on the bimodal mass- 
observable relation, Eq. [5l applied using the three redshift 
evolution cases for the mass bias of the blended component 
(none, weak, strong) discussed above. The redshift growth 
rate of the blending factor, 71, is a controlling degree of 
freedom. 

If the true sky is analyzed assuming no projection, 
meaning using a unimodal mass-observable relation equiv- 
alent to a "f{z) = assumption in Eq.O then the resultant 
projection-free counts, A^i, and sample covariance Si may 
differ from the true values of N and S, respectively. The 
set of model (cosmological -I- mass-observable) parame- 
ters, 6, recovered will generally differ from that of the 
true model. The bias in the model parameters is given by 



+ iTr{Ci-iSi,0Ci-i(S-Si)}]. (15) 

The covariance and Fisher matrix in the above expres- 
sion are evaluated for the projection-free model using pa- 
rameter values determined by fitting the redshift behav- 
ior of the first two moments of the mass-observable rela- 
tion with projection. For a specific choice of true model 
parameters ^0, a, 71, (and fixing J0t/3,Sz,sm = and 
(Tq = 2(Tq), we compute the mean mass and variance in 



redshift bins of width 0.1 and fit these to determine the 
terms Bq, Bi, si, S2, S3 of the unimodal model, Eqs. ^ 
and ([7]). Values for the case of 71 = 0.15 are given in Table 

in 



TABLE I: Projection-free mass-observable parameters fit to 



case 


with projection for 


71 = 0.15. 






IJ.0 


a 


Bo Bi 


2 

(T(, Si 


S2 


S3 


0.75 
0.50 
0.25 


0.00 
0.50 
1.00 


0.0076 0.0389 
0.0040 0.0470 
0.0004 0.0551 


0.2503 0.1110 
0.2500 0.0760 
0.2499 0.0475 


-0.1299 
-0.0497 
0.0111 


0.0491 
0.0097 
-0.0163 



For 71 = 0.15, the shifts in the mean mass are below 
one percent at z = but grow to 3.8% at z = 1 for the 
strong blending evolution case (a = 1). The mass bias 
fit, constrained by the form of Eq. ^ with only two free 
parameters, can differ from the true bias in the projection 
model by up to 0.007 at z = 1 when the fit is the worst (in 
the a = 0) case, but only by 0.002 for the best fit (a — 1) 
case. The redshift behavior of the variance, with four free 
parameters of Eq. ([4]), matches the values of the projection 
case quite well, with deviations less than 3 x 10^^ in the 
worst case. The variance at z = 1 is larger for larger values 
of a, with = 0.293 for the a = 1 case. 

For smaller 7 values, the fits deviate less from the true 
bias in projection. For comparison, the bias at z = 1 for 
a 71 = 0.05 fit, differs by 0.002 in the worst case and the 
mass bias is about 0.03. The variance also grows more 
slowly with z, with <t^ = 0.264 a.t z ~ 1 for the a = 1 
case. 

We compute survey expectations for counts (A) in mass, 
angle and redshift bins and their covariance (S) for the 
range < 71 < 0.3. We then calculate the counts (A^i), 
sample covariance (Si), full covariance (Ci^^), and Fisher 
matrix (Fi) for the respective projection-free case using 
the best-fit parameters described above. As mentioned in 
Sec. Ill A[ we add unbiased Planck priors to the Fisher 
matrix, so that Fi ^ Fi + i^pianck- The resultant values 
are used to compute bias in model parameters according 
to Eq. p5l). 

Fig. [2] shows the resulting biases in w and I^de- For the 
cases shown, we assume a Planck prior on the cosmolog- 
ical parameters but all other model parameters are free. 
When 71 = there is no blended component and there- 
fore no parameter bias (note we assume 70 = 0). The bias 
in cosmological parameter estimates grows approximately 
linearly with 71, with slopes that are weakly dependent 
on the assumed redshift evolution of the mass bias in the 
blended component. For the strong redshift evolution case 
{fiQ = 0.25, a = 1.0) with 71 — 0.17, which provides a close 
match to the simulation results of Cohn et al. [l^, we find 
a significant biases in the dark energy equation of state, 
Sw = 0.12, and in the dark energy density, 50de = —0.04. 

These shifts may be considered pessimistic, in the sense 
that we have assumed a large scatter in the blended com- 
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FIG. 2: Shifts in the cosmological parameters w (upper lines) 
and —Ode (lower lines) as a function of the blending evolu- 
tion rate, 7i. Line-styles correspond to three different redshift 
dependence forms (none, weak, strong) shown in Fig.IU 



ponent. For the case of cTbiond = '^O' ^^e slopes of the 
equivalent lines in Fig. [2] are reduced by ^ 50%, so that 
the strong redshift evolution case with 71 = 0.17 produces 
6w = 0.08 and SV,ue = —0.03. Reducing the assumed 
Co = 0.5 scatter in the clean component would also lead 
to smaller biases in cosmological parameters. 

TABLE IL Cosmological parameter shifts, SO, for strong red- 
shift evolution (/io = 0, a = 1.0) and 71 = 0.17. 



Parameter 


^truc 


se 




0.0227 


-0.0001 




0.1326 


0.0009 




0.742 


-0.0401 


W 


-1.0 


0.1178 




4.625 


0.0222 


n 


0.963 


-0.0015 


T 


0.087 


1.0 X 10"'^ 



Table |TT] shows the bias in all cosmological parameters 
for strong redshift evolution for 71 = 0.17, the case that 
best matches Cohn et al. [2^. The bias for parameters 
other than JIde and w is less than 1% of the fiducial value. 
However, comparing to the fiducial uncertainties from the 
Fisher matrix with unbiased Planck priors show that the 
shifts can approach a 1-a level for flmh'^ and (5^. 

Fig. [3] offers insight into the magnitude of the change 
in cluster counts arising from projection. As a fidu- 
cial measure, we use counts, Na^, for the projection- free 
(uniniodal) case with default parameters (zero bias and 
redshift- independent variance). The solid lines in Fig. [3] 
show the fractional shifts in counts, relative to the fidu- 
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FIG. 3: Fractional change in counts in the strong redshift evo- 
lution case (/io = 0.25,0: = 1), relative to a project ion- free 
model, are shown for three values of the blending evolution 
parameter, 71. Solid lines give the case with projection while 
dotted lines show the projection-free model with parameters 
tuned to match the mass bias and variance of the projection 
model, but with cosmology fixed at the fiducial WMAP5 val- 
ues. Dashed lines show the projection-free case after shifting 
all parameters (cosmological and mass-observable) according 
to Eq. (fT5)). 

cial, as a function of redshift for the projected (bimodal) 
cases with = 0.25, a = 1.0. For 71 > 0.1, projection 
boosts counts on the order of a few tens of percent at high 
redshift. The dotted lines show projection-free expecta- 
tions when the mass-observable parameters are shifted to 
the values given in Table HI but the cosmology is held 
fixed. The dashed lines give projection- free expectations 
when both cosmological and mass-observable parameters 
are adjusted according to Eq. ([T5|). 

The counts of the projection-free model with fully 
shifted parameters provide a good match to the counts 
with projection. The adjustment of the mass-observable 
parameters alone offers a good match at low redshifts, but 
at high redshift, a unimodal fit to the bimodal form of 
the projected p{Mohs\M, z) becomes increasingly less ac- 
curate. Adjustments in cosmological parameters shift the 
amplitude and shape of the mass function as a function of 
redshift, providing a degree of compensation for deficien- 
cies introduced by a unimodal p{Mo\ys\M,z) assumption. 
While the quality decreases for higher values of 71 , the fits 
arc still acceptable in a sense. 

Note that as 71 grows and the associated shifts in pa- 
rameters grow, the linear approximation for the bias given 
by Eq. ([15]) begins to break down. For 71 = 0.05 agree- 
ment between the shifted single-Gaussian case and the 
two-Gaussian case is quite good, while at 71 = 0.25 the 
divergence is much larger. 

Finally, we note that Eq. ([15]) calculates shifts using 
the Fisher matrix of the projection-free model. We have 
verified that we obtain the same results if we employ the 
projection model matrix with sharp priors added to the 
eight parameters describing the blended component. This 
is expected because, for the same free parameters in the 
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two models, the linearized equation should be symmetric 
under their exchange. 



B. Dark Energy Parameter Constraints 

While introducing additional parameters to describe se- 
lection with projected blending may eliminate bias in cos- 
mological parameters, that benefit comes with the risk 
of degrading cosmological parameter constraints. The 
amount of degradation depends on assumptions about pri- 
ors on model parameters. 

Table Hill summarizes results using the projection model 



2c^o, 70 



28| 



that corresponds to our best match of Cohn et al 
(lio = 0.25, a = 1.00, l3,Sz,SM = 0, crg^, 
0, 71 = 0.17). In all cases, Planck priors are added to the 
cosmological parameters, and we consider priors on the 
clean and blended cluster components separately. Given 
an assumed prior error, ai, on the i^^ parameter, we add 
to the Fisher matrix 



prior \ ^ 



(16) 



We consider sharp priors as being numerically larger than 
other entries in the Fisher matrix, generally i^sharp — 10^, 
and flat priors are given by i^flat = 0. 

TABLE III: Forecasts for w and JIde constraints using Planck 
priors. 



Priors 


Uncertainty 


Clean Blended 


a{QoE) a{w) 


sharp sharp 
flat sharp 
flat fiat 


0.002 0.011 
0.014 0.046 
0.034 0.109 


0.1 sharp 
0.1 0.1 
0.1 flat 


0.010 0.030 
0.010 0.030 
0.013 0.042 



Table Hill shows permutations of three basic cases: a flat 
prior on model parameters, a prior of a = 0.1 added to 
model parameters as well as a 10% prior added to CTq or 
CTg or sharp priors on model parameters. The last two 
columns give the marginalized uncertainty in w and Que- 

The first three rows compare extremal cases. Sharp 
knowledge of all mass-observable parameters produces the 
best constraints possible, ±0.002 in Hue and ±0.011 in 
w. The projection-free case with no prior knowledge of 
the six parameters of the clean component, shown in the 
second row, produces constraints of ±0.014 in Que and 
±0.046 in w. These errors are worse by factors of 7 and 4, 
respectively, than the case of perfect knowledge. Introduc- 
ing eight new degrees of freedom to represent the blended 
component further degrades the errors by somewhat more 
than a factor of two, to ±0.034 in Que and ±0.11 in w. 



Targeted follow-up and complementary survey informa- 
tion, from mm or X-ray observations for example, may en- 
able moderate priors to be placed on the bias and variance 
of the mass-observable relation. These cases are explored 
in the lower three rows of Table IIIII Knowledge of the 
clean component parameters at the level of ±0.1 provides 
substantial improvement over the flat case. Even with no 
prior knowledge of the blended component, the errors of 
±0.013 in Que and ±0.042 in w represent improvements 
over the projection-free case with no prior knowledge (sec- 
ond row). When 0.1 priors are brought to bear on the pro- 
jected blends, the constraints improve to ±0.010 in Que 
and ±0.030 in w. Stronger priors on the blended compo- 
nent do not improve these constraints. 



C. Discussion 

Achieving constraints on w and f^DE at the few per- 
cent level is a goal of next-generation cluster surveys. Our 
analysis shows that avoiding biases at this level requires 
projection to be incorporated into the likelihood modeling 
of optical-IR surveys. 

Prior knowledge of the blended component behavior 
can limit parameter bias. In Fig. |31 we illustrate the 
rate at which the forecast uncertainty in w changes with 
prior uncertainty on the mass-observable parameters of 
the blended component. The behavior for f^DE is similar, 
mainly because Planck priors effectively fix many of the 
correlations between cosmological parameters. The solid 
line shows the effect of applying priors to all eight param- 
eters while dashed lines show the behavior when priors are 
applied only to parameters controlling the blending am- 
plitude (70, 71), mass bias terms (/io, a, /3) and mass 
variance (crinAf.b) Sz, Sm)- In all cases, flat priors are 
imposed to the remaining mass-observable parameters. 

Applying priors to only the parameter subsets, we see 
that all three sets have comparable effects on w uncertain- 
ties. Because of covariance, the effect of applying strong 
priors to all parameters is much stronger than for any of 
the isolated sets. The error in w is somewhat more sensi- 
tive to the blending amplitude 7(2:) than to the bias and 
variance, but all parameters need to be known at the level 
of 0.1 in order to avoid significant degradation. 

For red sequence or photometric rcdshift cluster find- 
ing methods, the fraction of blended clusters is not likely 
to dominate the population, suggesting that the current 
level of uncertainty is near 0.1. By testing the perfor- 
mance of cluster finding algorithms on sophisticated simu- 
lations, and by calibrating mass selection based on multi- 
wavelength follow-up campaigns of existing deep cluster 
catalogs, the error may be reduced to ~ 0.05 as part of 
next generation survey analysis. In the right-hand panel 
of Fig. 31 we show the error in w for cases in which just 
the amplitude or both the amplitude and mass bias of the 
blending component are known at the ^ 0.05 level. These 
cases limit the degradation of the w constraint, to factors 
of 2.0 and 1.5, respectively, compared to 2.4 for the case 
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FIG. 4: Error in w is shown as a function of priors added to the parameters describing projected blends. Planck priors are apphed 
to cosmological parameters throughout. The left panel shows the error in w as a function of prior ct; applied to all eight blending 
parameters (black solid), and separately to only the parameters describing 7(2;) (red dashed), or only the parameters describing 
InMbias.b (green dot-dashed), or only the parameters for o■f^J^J^^ (blue dotted). The solid black line is reproduced in the right 
panel, which also shows the resulting error in w when a 0.05 prior is applied to just the blending amplitude, 7(a) (blue dashed), or 
when 0.05 priors are applied to both 7(2) and InMbias.b (red dotted). In the latter two cases, the results are shown as a function 
of prior at applied to the remaining blending parameters. 



of all parameters free. 

Our assumed value of ctq = 0.5 may be pessimistic, in 
that future cluster finders may achieve better mass selec- 
tion. An improved matched filter method [59| applied to 
the maxBCG catalog indicates a mass scatter closer to 0.3 
for low redshift clusters for a sample with mass thresh- 
old close to the value assumed here. While achieving this 
level of mass selection at 2; > 0.5 has not yet been demon- 
strated, the variety of cluster multi-color detection algo- 
rithms under active development [H, [13, [H, [H, [73l - [76| 
offer the potential of future gains. 

Spectroscopic observations of cluster fields provide valu- 
able empirical tests of blending. For example, a follow-up 
of 58 EIS cluster candidates, selected only with I-band 
imaging and so without the benefit of color-based red- 
shift filtering, found multiple redshift-space structures in 
a majority of fields [12] . Studies at high redshift using in- 
frared color selection, which are just beginning, fare bet- 
ter but are not entirely clean. Six rich clusters from the 
SPARCS sample, which uses z'-3.6/i color from ground- 
based and Spitzer observations, have been followed up 
with Keck/LRIS spectroscopy. Two cases appear to be 
strongly blended, with dynamical mass estimates derived 
from velocity dispersions lower by a factor ^ 6 than mass 
estimates based on their galactic richness. Continued 
follow-up of this and other IR-selected deep cluster sam- 
ples should be followed vigorously as a means to charac- 
terize the amplitude and mass scale of projected blends. 

Simulations of large-scale structure provide an effec- 
tive tool for understanding projection. Work is underway 
within the DES collaboration to test a variety of cluster 
finding algorithms against simulated expectations for the 
multi-band galaxy catalog. Using either galaxy member- 
ship or redshift-space location as a way to match clusters 



and halos, the simulations offer the means to test the sen- 
sitivity of blending to algorithm choice and to choice of 
parameters within a fixed algorithm [77| . Such studies 
should produce improved, algorithm-specific characteriza- 
tions of blends that can be coupled to empirically-derived 
studies to serve as prior information for cluster likelihood 
analysis. 

As algorithms improve in terms of mass selection, char- 
acterization of projection effects will inevitably become 
apparent through the full shape of the mass-observable 
relation, p{Mo]^s\M, z) or its inverse, the mass-selection 
function, p(M|Mobs) z). Ultimately, survey constraints on 
dark energy parameters have the potential to achieve the 
best possible constraints given by the first row of Table Hill 
Extracting a one percent constraint on w poses the chal- 
lenge of precisely characterizing selection. More careful 
analysis may suggest an improved, possibly more compact, 
form for modeling selection with projection than what we 
present here. 

While we focus our analysis on optical-IR studies, the 
issue of blending is generic to all cluster finding methods. 
The blending factor 7(2) should be minimal for X-ray se- 
lection, due to the compactness of the surface brightness 
image as well as the strong scaling of luminosity with mass. 
SZ selected samples are likely to incur blending at a level 
below that of optical-IR surveys However, for X-ray 
and SZ, angular resolution is also an important factor. 
The Planck satellite has only moderate resolution of 5-10', 
depending on frequency. Of the 21 new cluster candidates 
identified in the Planck ESZ sample, four are known to 
be double or triple systems from XMM follow-up imag- 
ing [tII. Follow-up studies of these and other SZ-selectcd 
sources from SPT and ACT is needed to characterize the 
mass selection of these methods. 
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IV. SUMMARY 



Appendix A: Cluster Counts and Clustering 



Cluster counts used in cosniological studies have typ- 
ically been modeled with log-normal deviations about 
power-law forms for the mass-observable relation. While 
a log-normal expectation may reasonably reflect intrin- 
sic scatter, projection will generically boost a minority of 
systems to higher observed signal. This blending of halo 
properties is particularly true for optical-IR surveys that 
use color or photometric redshifts as a distance estimator. 
We extend previous Fisher matrix studies by introducing 
a Gaussian mixture model for the mass-observable rela- 
tion. The model employs eight new parameters to describe 
a redshift-dependent amplitude and shape of the blended 
component, in addition to the six parameters of the dom- 
inant, non-blended cluster population. 

The presence of a minority of strongly blended clus- 
ters influences cosmological parameter estimation. For the 
case of blendiiig parameters tuned to Millennium Simula- 
tion analysis [2^ (Fig. [T]), we find that survey analysis 
using a projection-free (single Gaussian) analysis model 
introduces biases of 0.1 in w and —0.04 in fl^E- Compar- 
ing their Fisher forecast errors with Planck , these shifts 
are comparable to uncertainties expected using flat pri- 
ors on mass-observable parameters, but are an order of 
magnitude larger than the uncertainties possible under 
precise mass-observable knowledge. Explicit modeling of 
projection is therefore required to avoid significant bias in 
next generation cosmological studies using cluster counts 
and clustering. Optical studies at low redshift, where the 
blended fraction is below ten percent, or studies using 
cleaner detection methods, such as X-ray selection, are 
less susceptible to bias from projection. 

Constraints on w and JIde with Planck priors degrade 
by about a factor of 2.4 when new parameters to describe 
the Gaussian mixture distribution are included. Our anal- 
ysis indicates that 5% prior knowledge of the blending am- 
plitude and mass bias limit the degradation to a factor of 
1.5. 

Improved knowledge of blending will come from com- 
plementary approaches employing follow-up observations, 
simulations, and joint analysis of overlapping multi- 
wavelength surveys. Follow-up campaigns will provide 
mass estimates based on hydrostatic, virial and lensing 
masses. Simulations of the galaxy distribution will grow 
in fidelity, benefitting from empirical studies of the rela- 
tion between halo mass stellar content to z ~ 1 [13, HH . 
Optical cluster finders applied to such simulated sky ex- 
pectations will inform prior constraints on projection ef- 
fects. Over the next decade, the ability to cross-match 
large cluster samples from mm to X-ray wavelengths will 
offer a new window into the nature of the relationship be- 
tween clusters and the massive halos that host them. 



For the space density and clustering of halos, we follow 
conventions used in previous work j66| . The mass function 
is 



dn 
dM 



'm dM ' 



(Al) 



and we adopt the Tinker parameterization of /(u) [20l |. 



.fi<j) = A 





— a 


h 1 


[(i) 











(A2) 



For fiducial parameters, we adopt the values of [2C 
at mean density contrast A — 200: Aq = 0.186, - 
-0.14. ao = 1.47, = -0.06, bo = 2.57, logio(a) = 
(,^^^)i-2, and c= 1.19. 

The sample covariance of counts TV^ ^ is, given by [s^ 



s: 



Q/3 



d^k 



(A3) 



^W:ik)W,ik)^/P,ik)P,ik), (A4) 



(27r) 



where ba_i{z) is the average cluster linear bias parameter, 
defined as 



bad{z) 



1 



na.iiz) 



dM, 



obs 



M: 



obs 



dM 



x^^jJ^&(M;z)p(AU.|Af). (A5) 



W^*(k) is the Fourier transform of the top- hat window 
function and Pi{k) is the linear power spectrum at the 
centroid of redshift bin i. 

We adopt the b{M, z) fit of (S^l for the halo bias 



5(M, z)^l + 



1 



2Vc 



(5c[l + (a(52/cr2)Pc 



r, (A6) 



and choose the fiducial values for the parameters to be 
a,; = 0.75, (5c = 1.69, and = 0.3. 

Following |82| , we find that the window function W* (k) 
is given by 



\A7(\.\ o i M sin(fc||(5r,/2) Ji(fcj,r,6',) 

^^^^^ = ^''''^^'^\\^''^^^:;^2 — k^;^- ^^^^ 

Here = r{zi) is the angular diameter distance to the i"^ 
redshift bin, and Sr.i ~ r{zi^i) — r[zi). Similarly, Hi = 
H{zi) = H(z), which we assume to be constant inside 
each bin. The variables fcy and k± represent parallel and 
perpendicular components of the wavenumber k relative 
to the line of sight. 
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